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ABSTRACT: We describe a copy-number variant (CNV) 
for which deletion alleles confer a protective affect against 
rheumatoid arthritis (RA). This CNV reflects net unit 
deletions and expansions to a normal two-unit tandem du- 
plication located on human chrl2p 13.31, a region with 
conserved synteny to the rat RA susceptibility quantita- 
tive trait loci Oia2. Genotyping, using the paralogue ratio 
test and SNP intensity data, in Swedish samples (2,403 
cases, 1,269 controls) showed that the frequency of dele- 
tion variants is significantly lower in cases (P = 0.0012, 
OR = 0.442 [95%CI 0.258-0.755]). Reduced frequen- 
cies of deletion variants were also seen in replication ma- 
terials comprising 9,201 UK samples (1,846 cases, 7,355 
controls) and 2,963 US samples (906 controls, 1,967 
cases) (Mantel-Haenszel P = 0.036, OR = 0.559 [95%CI 
0.323-0.966]). Combining the three datasets produces a 
Mantel-Haenszel OR of 0.497 (P < 0.0002). The dele- 
tion variant lacks 129-kb of DNA containing SLC2A3, 
NANOQPl, and SLC2A14. SLC2A3 encodes a high- 
affinity glucose transporter important in the immune re- 
sponse and chondrocyte metabolism, both key aspects of 
RA pathogenesis. The large effect size of this association, 
its potential relevance to other diseases in which SLC2A3 
is implicated, and the possibility of targeting drugs to in- 
hibit SLC2A3, argue for further examination of the genet- 
ics and the biology of this CNV. 
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Introduction 

Rheumatoid arthritis (RA) is a chronic autoimmune disease that 
manifests as inflammation of the synovium and severe joint dam- 
age, along with other complications such as cardiovascular disease. 
It affects approximately 1% of the global population, predominantly 
women and the elderly, and is treated symptomatically as there is 
currently no cure. The inflammation of synovial joints in RA leads 
to hyperplasia of the synovial cells, excess synovial fluid, and the 
development of pannus (an inflammatory granulation tissue). Per- 
sistent synovitis leads to the destruction of articular cartilage and 
subsequent debilitating bone damage. Environmental factors, such 
as smoking, play a role in RA risk [Klareskog et al., 2006; Morgan 
et al., 2009; Silman et al., 1996] . However, around 60% of the over- 
all risk is attributable to genetic factors [MacGregor et al., 2000], 
approximately one-third of which is conferred by shared epitope 
HLA alleles within the major histocompatibility complex (MHC) 
[Gregersen et al., 1987]. A number of other risk loci have been 
identified, particularly since the advent of SNP-based studies, in- 
cluding PTPN22 (MIM #600716) [Begovich et al, 2004; Gregersen 
et al., 2006], STAT4 (MIM #600558) [Remmers et al., 2007], C5 
(MIM #120900)/ rJMFJ (MIM #601711) [Plenge et al., 2007], and 
TNFAIP3 (MIM #191 163) [Thomson et al, 2007]. Including recent 
data from a large meta-analysis of GWAS RA studies, the number 
of confirmed genetic risk loci is 46 [Eyre et al., 2012; Stahl et al., 
2010]. However, these loci contribute relatively modest per locus 
effect sizes to RA susceptibility (OR 2 1.8), leaving much of the 
genetic risk unaccounted for [as reviewed in Raychaudhuri, 2010]. 
The remaining genetic risk could be due to other types of varia- 
tion not routinely investigated such as rare single-nucleotide alleles 
[Dickson et al., 2010], epigenetic modifications, and copy-number 
variation (CNV). Latest estimates of CNV suggest that up to 16% 
of the genome is commonly copy-number variable [Conrad et al., 
2010; Itsara et al., 2009; Mills et al., 2011; Redon et al., 2006]. 
There is increasing evidence for the involvement of CNV in disease 
susceptibility, not least for autoimmune diseases such as systemic 
lupus erythematosus [Yang et al., 2007] and psoriasis [HoUox et al., 
2008]. Copy-number changes of the CCL3L1 (MIM #601395) gene 
have previously been shown to be associated with RA susceptibil- 
ity and HIV progression [Gonzalez et al., 2005; McKinney et al., 
2008]. 

Previous work by others and us used oil-induced arthritis rat 
models and linkage analysis to discover RA quantitative trait loci: 
Oial that contains the MHC genes, and 0/fl2 that maps to a 1.2-Mb 
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interval on rat chromosome 4q42 [Jansson et al., 1999; Lorentzen 
et al., 1998; Ribbhammar et al., 2003]. The rat Oia2 region shows 
conserved synteny with human chromosome 12pl3.31, which it- 
self resides within a larger RA susceptibility locus [Jawaheer et al., 
2003]. We previously demonstrated association between SNPs in 
this interval and RA in humans [Lorentzen et al., 2007] . Further- 
more, assaying SNPs in this region by the dynamic allele-specific 
hybridization (DASH) genotyping method [Fredman et al., 2004] 
produced semiquantitative readouts that suggested the presence of 
structural variation. 

We now present the discovery and characterization of a large 
CNV within the chrl2pl3.31 interval. Genotyping of this CNV 
by various methods (laboratory and informatics based) in multiple 
population samples produced highly consistent evidence that a dele- 
tion spanning the SLC2A3 (MIM #138170) gene confers substantial 
protection against developing RA. 



Methods 

Samples 

Swedish case and control RA samples used in this study have 
been used for previous investigations [Lorentzen et al., 2007]. All 
RA case samples fulfilled the American Society of Rheumatology 
1987 revised criteria for RA [Arnett et al., 1988]. The Swedish RA 
cohort was made up of 2,403 patients with RA and 1,269 control 
samples [Lorentzen et al., 2007]. Controls were collected from the 
same study area and had similar distribution in age, sex, and res- 
idential area. Anti-citruDinated protein antibodies (ACPA) status 
was available for the Swedish samples of which 64% of cases were 
ACPA positive. ACPA status was not used to stratify the association 
data, as there would have been insufficient power to exclude asso- 
ciation in ACPA-negative samples. In addition ACPA assays do not 
detect all ACPA-positive samples (sensitivity 70%-80%) and have a 
false positive rate of between 4% and 12%. The UK RA case group 
comprised 1,846 RA case samples, and the UK control group com- 
prised 7,355 samples from the 1,958 British Birth cohort collection 
as has been previously described by the Wellcome Trust Case Con- 
trol Consortium [Wellcome Trust Case Control Consortium, 2007] . 
The WTCCC also assessed this sample for population stratification 
and only a small number of genomic regions exhibited detectable 
stratification across a NW/SE divide. The CNV examined in this 
study did not reside in any of these intervals. The US RA collec- 
tion consisted of 1,967 cases and 996 controls. The RA case subjects 
were enrolled from across the United States as part of the North 
American Rheumatoid Arthritis Consortium (NARAC) collections 
I and II [Gregersen et al. 2009; Plenge et al. 2007], and all subjects 
either met the 1987 American College of Rheumatology criteria 
for diagnosis of RA [Arnett et al., 1988]. As reported previously, 
controls were obtained from a local New York cohort [Mitchell 
et al. 2004] , and matched to cases using ancestry- informative mark- 
ers, as described previously [Gregersen et al. 2009; Plenge et al. 
2007]. 



Oligonucleotide-Array CGH 

Oligonucleotide-array CGH was performed by Nimblegen Inc. 
using 2-/[ig of DNA on a microarray chip of 1 52,452 probes spanning 
3.5 mb of chrl2pl3.31. Log2 ratios of each of the five pairs of DNA 
samples were averaged over 500-bp intervals. 



Human Genome Project Trace Archive and BLAT 
Characterization of Region 

Human genome project (HGP) sequencing traces were 
downloaded from the NCBI Trace Archive (http://www.ncbi. 
nlm.nih.gov/Traces/) and aligned to the reference genome (build 
NCBI36/hgl8, March 2006) using the GSAssembler V2 software 
(Roche, Burgess HiU, West Sussex, UK). 



CNV Genotyping Using the Paralogue Ratio Test 

The paralogue ratio test (PRT) was selected as it has been demon- 
strated to be robust in other studies, more reliable than qPCR 
and has low-DNA quantity requirements [Aldhous et al., 2010; 
Armour et al., 2007; CantsUieris and White, 2013; Fode et al., 
2011; HoUox et al., 2008]. Assay PI (primers PIF: 5'-TATTGCACC 
TTAACCTCTCCAGC-3' and PIR: 5'-CTCACTTCCATACAGCT 
CTACG-3') amplifies two products, one within the 3' untranslated 
region of SLC2A3 (chrl2:8073299-8073582), and one within the 
equivalent region in SLC2A14 (chrl2:7966286-7966484). Partial 
PRT (pPRT) is a modification of PRT that uses three primers in 
each reaction; one primer is matched to both targets, and the two 
remaining primers are each uniquely matched to one target. pPRT 
primers used are listed in Supp. Table SI. PCR reactions were per- 
formed on either 384 or 96 well microtiter plates, case and controls 
were intermixed on these plates. Genetic association studies can be 
susceptible to bias resulting from batch effects due to DNA prepa- 
ration, interlab handling differences, and DNA quality [Clayton 
et al., 2005; lonita-Laza et al., 2009] . We have previously performed 
in-depth investigations into causes of these biases, in particularly 
how they affect PRT [Veal et al., 2012]. We have utilized methods 
developed as part of that research to minimize any such effects, if 
they were present: PCR reactions contained 10 ng DNA, 1 x buffer 
B (Kapa Biosystems, Woburn, MA), 2 M betaine (Sigma-Aldrich, 
Gillingham, Dorset, UK), 0.2 mM dNTP (Roche), 0.15 /xM each 
primer, and 0.02 U Taq DNA polymerase (Kapa Biosystems). PCRs 
were performed in an MBS 0.2G thermal cycler (Thermo Scien- 
tific, Waltham, MA) as follows: 98°C for 1 min; 35 cycles of 98°C 
for 15 sec, annealing temperature for 15 sec and 72° C for 1 min, 
followed by a final extension carried out at 72° C for 5 min. 

ABl PCRs were used to directly assay recombination at the 
ABl sequence. Primers AF and UR detected P1[B] deletions; each 
20 fil reaction contained 20-ng DNA, Ix ll.lx buffer (0.49 M 
Tris-HCl pH8.8, 0.12 M [NH4]2S04), 0.05 M MgClj, 77 mM P- 
mercaptoethanol, 5 fiM EDTA pH8.0, 11.1 mM each nucleotide 
(dATP, dCTP, dGTP, and dTTP), 1.3 mg/ml bovine serum albu- 
min), 0.3 juM each primer, and 0.1 U//otl Tag DNA polymerase (Kapa 
Biosystems). Cycling conditions were: 96°C for 5 min followed by 35 
cycles of 96°C for 30 sec, annealing temperature for 20 sec, and 68°C 
for 4 min. Primers BF and UR were used to investigate P1[B] du- 
plications; each 20-/xl reaction contained 20-ng DNA, 1 x FastStart 
High Fidelity Reaction Buffer (Roche), 0.2 mM dNTPs, 0.4 fiM each 
primer, 0.05 U/fA FastStart High Fidelity Enzyme (Roche). Cycling 
conditions were: 95°C for 2 min, followed by 35 cycles of 95°C for 
30 sec, annealing temperature for 30 sec and 72°C for 4 min. 

PCR products were separated on 300 ml 2% (w/v) Seakem LE 
agarose gels (Lonza, Basel, Switzerland) in Ix Tris-Borate-EDTA 
buffer by electrophoresis. Electrophoresis was performed at 200 V 
for 45 min. Gel images were captured with the GBOX gel docu- 
mentation system (Syngene, Cambridge, UK), and signal intensity 
data for each product were extracted using the GeneSnap software 
(Syngene). 
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Computational Genotyping of the CNVfrom GWAS Data 

PennCNV uses a hidden Markov model utilizing multiple sources 
of information, including allelic ratio distribution and intensity 
data, to genotype CNVs in SNP genotyping data [Wang et al., 
2007]. In this study, 1,971 Swedish samples had been genotyped 
using the lUumina Infinium HapMap 300 SNP chip (lUumina, San 
Diego, CA) and 2,963 US samples that had been genotyped by the 
lUumina Infinium HapMap 370 and 500 chips (lUumina) as pre- 
viously described [Gregersen et al., 2009; Plenge et al., 2007]. For 
all these samples, CNV genotypes were called using the standard 
PennCNV settings including adjustment for GC waves. As the ac- 
curacy of PennCNV is dependent on the size of the CNV, number 
of SNPs, and quality of SNP genotyping, we plotted the B allele 
frequency (BAF) deviation from expected values for heterozygous 
SNPs against mean Log R ratio (LRR) across the CNV to visually 
assess the clustering of CNV calls (Supp. Fig. SI). For variant sam- 
ples, there is a clear distinction from normal samples indicating that 
PennCNV can accurately caU this CNV. This is seen particularly 
for the deletion samples in which there is no overlap with normal 
samples. For the Swedish samples, PI genotyping was avaUable for 
1,475 of the samples genotyped by PennCNV. For the US samples, 
for which whole-genome chip-based genotyping data were avaUable, 
all variant samples detected by PennCNV were inspected visually to 
confirm the presence of a deletion aUele. 

Data Analysis 

The ratio of the two products from a PRT or pPRT was calculated 
by dividing the peak signal intensity of the product from within 
repeat unit B by the peak signal intensity of the product from within 
repeat unit A. For assay PI, data for each row of a gel were nor- 
malized by multiplying by the reciprocal of the median for each 
row of samples. Normalized ratios were transformed by log2 for 
further analysis. Samples were categorized according to expected 



ratios (see Results) with boundaries determined by visual inspec- 
tion of the spread of ratios plotted using the statistical package R [R 
Development Core Team, 2008] . The significance of any differences 
in frequency between Swedish case and control samples was deter- 
mined using the two-tailed chi-squared test on a 2 x 2 contingency 
table. As our initial data had indicated that the frequency of the 
P1[B] deletion allele was much lower in the UK and US samples 
(therefore reduced power to detect association), we combined the 
US and UK data to maximize power. Given the two populations 
are distinct both geographically and in method of genotyping, we 
used the Mantel-Haenszel meta-analysis of the odds ratio. The va- 
lidity of pooling the odds ratios was confirmed using Woolf s test 
for heterogeneity. 

Results 

Identification and Characterization of a CNV at chr12p13.31 

Oligonucleotide-array CGH of the chrl2pl3.31 region was per- 
formed across 10 samples, which were selected based upon DASH 
genotyping patterns. One oligonucleotide-array CGH experiment, 
conducted upon a sample pair on chromosome 12 and selected 
based on DASH genotyping patterns, revealed a large (>100 kb) 
copy-number change spanning the gene SLC2A3, the pseudogene 
NANOGPl and part of the gene SLC2A14 (Fig. 1). By compre- 
hensive long-range PGR plus next-generation sequencing, mining 
of public trace archives, and targeted gap-closure experiments via 
short-range PGR, we established that the 12pl3.31 CNV in question 
entails the gain/loss of one net repeat unit from a two copy tandem 
duplication (Fig. 2A). The two repeat units, which we termed "A" 
and "B" (~100 and 145 kb, respectively), have very diverged pat- 
terns of repetitive elements, in particular Alu elements (Fig. 2B), but 
are otherwise around 95% simUar at the DNA sequence level (stan- 
dard deviation 4%, minimum 80%, maximum 100%, interquartUe 
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Figure 1. Oligonucleotide-array CGH identifies a CNV on chromosome 12p13.31. Positions of genes are displayed on a UCSC Genome Browser 
track. Nimblegen oligonucleotide-array CGH data are plotted beneath this track, each point is the average log2 ratio of probes in 500-bp windows. 
A section (>100 kb, shaded) of points deviate from the expected logz of 0 indicating the presence of a copy-number difference between the tested 
samples. 
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Figure 2. Tandem repeat spans the CNV at chr 12p13.31. A: The position of a tandem repeat at 12p13.31, identified through examination of 
sequence data, is shown overlaid onto a UCSC genome browser track displaying the genes and repetitive elements within this region. The two 
repeat units, "A" and "B" are delimited by the red and blue boxes, respectively. The position of the CNV identified by the Nimblegen oligonucleotide- 
array CGH is shaded in gray and represents a net gain/loss of one repeat unit from the two unit tandem repeat structure. B: The two repeat units 
are similar in sequence, but have many differences in the repetitive element structure. To illustrate this, equivalent sections of sequence from each 
unit are presented with the position of genes and repetitive elements taken from UCSC genome browser. Pink shading and dotted lines represent 
regions of sequence that are similar in both units. It can be seen that differences between the two sections are mainly due to different repetitive 
elements. 



Allele 



NAHR 



Aligned structure 



Ratio Log, 



P1[A1 



P1[B) 



Normal chromosome 



2:2 



Deletion spanning P1[B] 



1:2 



-1 



Duplication spanning P1[B] 




3:2 



0.58 



Deletion spanning P1[A] 



2:1 



Duplication spanning P1[A] 




2:3 



-0.58 



Figure 3. Relative ratios generated by CNV alleles forthe PI assay. Effects of possible CNV alleles on assay PI are displayed on a diagrammatic 
portrayal of the tandem repeat. Four theoretical allele types, deletion or duplication spanning PI [A] or P1[B], are displayed below the representation 
of the tandem repeat on a normal chromosome. The two units of the tandem repeat are presented, "A" in red and "B" in blue. The positions of each 
of the targets of the PI assay are marked (PI [A] and P1[B]). An example NAHR event is displayed for each variant allele. The aligned structure 
column displays the variant chromosome with deleted or duplicated material aligned to the normal chromosome and the effects on each locus of 
the PI assay. The ratios of PIB/PIAfor a heterozygous individual are displayed for diploid genome ratio, and the log2 of the ratio. 
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range 92%-98%, 100 bp windows free of repetitive elements). The 
tandem duplication is seen to be present in other higher primates 
(Chimpanzee buUd CGSC 2.1.3/panTro3; Rhesus Macaque buUd 
MGSC Merged 1.0/rheMac2) but not in other mammals such as 
mice (build NCBI37/mm9). From estimated dates of simian diver- 
gence and historic periods of Alu expansion, it may be likely that 
the ancestral duplication event occurred at least 50 million years 
ago [International Human Genome Sequencing Consortium et al., 
200 1 ]. Four known genes reside within the tandem repeat: NANOG 
(MIM #607937) encoding a transcription factor expressed in embry- 
onic stem cells and a key factor in the maintenance of pluripotency 
[Mitsui et al., 2003], SLC2A14 (MIM #611039) encoding GLUT14 
a glucose transporter expressed specifically in the testes [Wu and 
Freeze, 2002], NANOGPl that is a transcribed but untranslated 
pseudogene of NANOG, and SLC2A3 encoding GLUT3, a glucose 
transporter with the highest affinity for glucose among the family 
of GLUT proteins, which is expressed in various tissues, including 
chondrocytes, and plays an essential role in embryonic development 
[Schmidt etal., 2009]. 

CNV Assay Development 

To investigate this CNV further, we employed assays based upon 
the PRT — a method that in typical scenarios uses one pair of PGR 
primers to coamplify (and hence allow quantitative comparison of) 
both a test region (whose copy number is being assessed) and a stable 
single copy reference region [Armour et al., 2007]. However, since 
the high degree of sequence identity between the two repeat units 
and the high density of repeat elements in this region precluded 
the use of a standard PRT, we modified the concept to instead am- 
plify equivalent but differently sized segments from each unit of 
the tandem duplication. The unit B/unit A ratio of products in this 
case was then taken to indicate relative changes in copy number 
between the two units at the sites being amplified, rather than abso- 
lute copy-number values. Several assays were initially designed and 
optimized on test DNAs. The most robust of these, assay PI, am- 
plifies sequences from chrl2:7966286-7966484 (PI [A]) and from 
chrl2:8073299-8073582 (P1[B]) (positions according to GRCh37). 
Figure 3 illustrates how the position of potential single-unit deletion 
or expansion of this tandem repeat (assuming a simple interunit re- 
combination mechanism of creation) would affect the product ratio 
P1[B]/P1[A]. 

In evaluating the PI assay, 95 Swedish control samples were 
genotyped in triplicate, and this convincingly revealed five samples 
for which the P1[B]/P1[A] log2 ratio was substantially greater or 
less than the value of zero (expected for genomes diploid for the 
"normal" two unit arrangement). Additionally, 12 CEPH DNAs 
were genotyped in four replicates, and the results were highly 
similar in each repetition with clear separation of variant and 
normal samples (Supp. Fig. S2). 

Determining Original Nonallelic Homologous 
Recombination Recombination Sites 

Genotyping using PI across 3,794 UK control samples revealed 
tight clustering of log2 values around those expected for deletions or 
duplications spanning the P1[B] locus (Supp. Fig. S3). CNVs involv- 
ing segmental duplications of this size are typically derived by non- 
allelic homologous recombination (NAHR) [Conrad et al., 2010]. 
Underlying cross-over sites can be identified by assaying the relative 
abundances of the tandem duplication sequences in individuals car- 
rying the variant chromosomes. As illustrated in Figure 4, by using 
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0.442" (0.258-0.755) 









''X^Pvalue = 0.0012. 



a set of "pPRTs" (as explained in Methods), specifically designed to 
quantify abundance ratios for a series of loci across the two tandem 
repeat elements, we demonstrated that two of two Yoruban HapMap 
samples with an extra PI [B] copy were generated by NAHR events 
within an interval of ^8.8 kb (between B5[A] and B6[A]) in the A 
unit and ^6.1 kb in the B unit (between B5[B] and B6[B]). In con- 
trast, three of three European samples with an extra P 1 [B] copy, and 
three of three European samples lacking one PI [B] copy, all seem to 
have been generated by NAHR between pPRTs B9 and BIO. Targeted 
PGR across this latter ancestral breakpoint refined this interval to an 
1,100 bp stretch of region, which we termed ABl, shared between 
the two units (A:7995630-7996700, B:8124315-8125390 [genome 
buUd GRCh37])(Supp. Fig. S4). This is by far the main breakpoint 
in UK samples, in that it was shown to be present in 96.5% of 308 
confirmed variant UK DNAs assessed by pPRTs and targeted PGRs. 
Rearrangements with ABl breakpoints would directly impact the 
two glucose transporter genes; SLC2A3 would be completely deleted 
or duplicated, and SLC2Ai4 would be partially duplicated or deleted 
(Fig. 5). HGVS nomenclature for the variants is as follows; deletion: 
chrl2.hgl9:g.(7995600_7996800)_(8124300_8126400)del, duplica- 
tion: chrl2.hgl9:g.(7995600_7996800)_(8124300_8126400)dup). 

CNV Genotyping and Association with RA 

To test the 12pl3.31 CNV for association with RA, assay PI was 
used to examine a Swedish cohort of 2,403 RA cases and 1,269 
controls. Genotypes were categorized according to the expected 
ratios given in Figure 3, with boundaries determined by clustering 
of P1[B]/P1[A] log2 scores. We made the assumption that each 
individual will have at least one chromosome with the normal allele 
(likely to be true in almost all subjects, given the low frequency of 
CNV alleles). As summarized in Table 1, the count of genotypes 
having a P1[B]/P1[A] log2 ratio <-0.75 (i.e., deletion variants that 
remove the PI [B] locus) is significantly reduced in cases compared 
with controls (P = 0.0012). As expected, the mean log2 values for 
samples with PI [B] deletions were similar between cases (-1.16) and 
controls (-1.08). For this CNV allele, the odds ratio is 0.442 (95% 
CI 0.258-0.755), indicating that individuals with a deletion of the 
region spanning PI [B] are 2-2.5-fold less likely to develop RA. To 
assess the impact of potential misclassification of the CNV alleles, 
the boundaries for the P1[B] deletion were varied. The P values 
were seen to remain significant at the 5% level even when extreme 
thresholds for group classification were applied (Supp. Fig. S5). 

Replication in UK/US Sample Collections 

A replication study was genotyped for the deletion variant of the 
CNV using PI in 9,201 UK samples (7,355 controls and 1,846 cases) 
and using PennCNV in 2,963 US samples (996 controls and 1,967 
cases). Due to power considerations (smaller size of the case or con- 
trol materials and a lower frequency of deletion variants in UK/US 
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Figure 4. Locating ancestral NAHR events. A: The location of sequences amplified by PRT PI and pPRTs B5, B6, B8, B9, and BIO are indicated 
on the tandem repeat (unit A in red, unit B in blue), along with the four genes in the region with arrows showing the direction of transcription. B: 
The chart-labeled "Increased P1[B]/P1[A]" displays five samples whose previous PI assay data suggested the presence of an extra copy of the 
P1[B] sequence on one chromosome, plus a normal sample (purple). Ancestral NAHR events can be taken to have occurred in intervals flanked 
by assays with markedly different relative abundances of unit B and unit A sequences (see Fig. 3). The blue and orange samples are two Yoruban 
DNAs, both of which reveal ancestral NAHR events between the sites of assays B5 and B6. The red, dark blue, and green samples are European in 
origin and reveal NAHR events between markers B9 and BIO. The chart labeled "Decreased P1[B]/P1[A]" displays three samples whose previous 
PI assay data suggested they were missing the P1[B] site from one chromosome, plus a normal sample (purple). All three non-normal samples are 
European in origin and reveal additional ancestral NAHR between markers B9 and BIO. 
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Figure 5. Theoretical products from recombination events at ABl. The two diagrams show the theoretical duplication and deletion products 
of nonallelic homologous recombination at ABl (coordinates according to genome reference GRCh37). Red indicates sequence originating from 
unit A and blue indicates sequence originating from unit B. A: In the chromosomes resulting in duplication, sequence from 7,996,000 to 8,125,000 is 
duplicated. This causes duplication of NANOGPl, SLC2A3, and part of SLC2A14. B: In chromosomes resulting in deletion, sequence from 7,996,000 
to 8,125,000 bp is removed. This causes loss of NANOGPl, SLC2A3, and part of SLC2A14. 
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Table 2. UK and US RA Data 



Genotyping Accuracy Using PI Assay 



UK RA data 



Log2 ratioPl[B]/Pl[Al 


<-0.75 


>-0.75 




PI category 


P1{B) deletion 


Other 


Totals 


Case 


9 (0.49%) 


1,837 (99.51%) 


1,846 


Controls 


67 (0.91%) 


7,288 (99.09%) 


7,355 


Odds Ratio 


0.533* (0.248-1.107) 






US RA data 


PermCNV 


P1(B) deletion 


Other 


Totals 


Case 


1 1 (0.56%) 


1,956 (99.44%) 


1,967 


Controls 


9 (0.90%) 


987 (99.10%) 


996 


Odds Ratio 


0.617' (0.237-1.620) 







"Combined Mantel-Haenszl odds ratio of 0.559 (P = 0.036). 
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Figure 6. Forest plot of odds ratios for Swedish, US, and UK samples 
forthe deletion variant of the CNV. The diagram, generated by the rmeta 
package for R 2.16, displays the odds ratios for each population as a 
box with the 95% confidence intervals indicated by lines. The summary 
represents the 95% confidence interval of the Mantel-Haenszel com- 
bined odds ratio for all three sample collections, the contribution of each 
population to the combined odds ratio is represented by the size of the 
odd ratio box for the corresponding population. 



populations), this disease association analyses considered only the 
putative etiological deletion variant discovered in the Swedish ma- 
terials. Association results supported our initial findings completely 
in terms of direction and effect size of disease risk (Table 2): a 
decreased frequency of genotypes indicating a deletion was appar- 
ent in RA cases compared with controls. This is highly apparent 
when viewed graphically (Fig. 6). Given the UK and US populations 
are distinct, both geographically and in method of genotyping we 
used the Mantel-Haenszel meta-analysis to calculate a pooled odds 
ratio of 0.559 (95% CI 0.323, 0.966; P= 0.036). Importantly, Woolfs 
test indicated that there is no evidence against homogeneity between 
the two datasets. 



The accuracy and robustness of the PI assay and SNP intensity 
data are critical to the validity of the disease association we herein 
report. We applied experimental designs to minimize technical bias, 
described in Methods, and plots of the genotyping measurements 
(PI [B]/PI [A] ratio, mean LRR, and mean BAF) do not provide any 
evidence for bias between cases and controls (Supp. Figs. SI and S6). 
In addition, we reassessed a subset of 368 samples from the 1,958 
UK controls including all 67 samples for which initial genotyping 
indicated a PI [B] deletion, and a random set of 301 DNAs scored as 
having neither a deletion nor an insertion. These samples were reex- 
amined by genotyping again using a combination of repeating the 
PI assay, using the five independent pPRTs (which had proven abil- 
ity to detect this CNV in determination of original NAHR events), 
and direct assessment using the ABl deletion/duplication-specific 
assays. Of the 67 deletion samples, 63 were confirmed by AB I assays, 
of the remaining four, three were confirmed by 100% replication in 
the five pPRTs and in two repeats of the P I assay. Therefore, only one 
of the 67 deletion samples was found to be a misclassified normal 
sample — giving a false-positive deletion assignment rate of 1/7,289 
(since 7,288 samples were initially scored as nondeletions). Of the 
301 normal copy- number samples, none were found to be a mis- 
classified deletion variant sample in either the ABI assay or in at 
least two repeats of the PI assay — giving a false-negative deletion 
assignment rate of 0/301. These data, representing multiple levels 
of fully independent assay validation, indicate that the overall mis- 
classification rates are extremely low, and certainly not sufficient to 
invalidate the discovered disease association. 

To provide further quality control for the PI assay, we compared 
PI genotyping results with a set of copy-number assignments gen- 
erated by a non-PCR-based technology. Specifically, for 1,475 of 
the Swedish samples genotyped with PI high-quality lUumina 300k 
HapMap SNP genotyping data were available, enabling us to call the 
CNV alleles using the PennCNV algorithm [Wang et al., 2007]. The 
concordance rate between the PI assay and PennCNV was 99.7%. 
The 0.3% discrepancy consists of both the very small error rate 
described in the UK controls and the error rate of the PennCNV 
algorithm. Nevertheless, if one were to assume the entire 0.3% orig- 
inated from the P I assay, the association in the Swedish study would 
remain significant. 



Discussion 

We have described the discovery of a CNV at I2pI3.3I that in- 
volves the gain/loss of one net unit (as portions of adjacent units) 
of a normal two unit tandem repeat, and the association of the 
deletion allele with RA protection (combined analysis [Sweden, 
US, and UK] Mantel-Haenszel OR = 0.497 [95%CI 0.341,0.725], 
P = 0.000 1 94) . This deletion partially disrupts SLC2A1 4 and entirely 
deletes SLC2A3 and NANOGPl. Since NANOGPl is expressed but 
untranslated and SLC2A14is only expressed in the testes, they are not 
obvious candidates for a direct role in RA. In contrast, the GLUT3 
product of SLC2A3 plays an important role in two key areas relevant 
to RA: the immune response and chondrocyte function. Related to 
immune response, activated T- and B-ceUs, as well as macrophages, 
are present in RA-affected synovial joints. A 3.5-6-fold increase in 
the expression of GLUT3 is seen in activated T- and B-cells, and 
monocyte to macrophage differentiation is associated with an in- 
crease in GLUT3 expression. This increased GLUT3 expression in 
macrophages is maintained after transformation to foam cells and 
is thought to provide fuel for the immune response, in addition to 
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allowing leukocytes to compete for sugars in low- interstitial glu- 
cose concentrations [Fu et al., 2004; Maratou et al., 2007]. Related 
to chondrocyte function, glucose plays a critical role in chondro- 
cyte metabolism and physiology, and GLUTl, GLUT3, and GLUT9 
are all expressed in normal chondrocytes. GLUTS is essential for 
facilitated diffusion of glucose into chondrocytes [Mobasheri et al., 
2008] . Chondrocytes are involved in RA disease progression through 
destruction of the extracellular matrix. Evidence for this comes 
from the exclusive production of the collagen and proteoglycan 
proteinase MMP-1 by chondrocytes in diseased joints, and from 
arthritis mouse models where an increased level of cartilage damage 
was seen when apoptosis of chondrocytes was prevented [Ainola 
et al., 2005; Barksby et al., 2006; Butler et al., 1997; Otero and 
Goldring, 2007] . It has also been proposed that chondrocytes them- 
selves may be a source of pro-inflammatory cytokines, which aid 
joint destruction by increasing the breakdown of tissue and sup- 
pressing repair mechanisms. As a result, cartilage is degraded faster 
than it can be repaired, leading to destruction of the joint [Otero 
and Goldring, 2007]. With this in mind, we hypothesize that the 
protective effect of a PI [B] deletion genotype of the 12pl3.31 CNV 
may be due to a decreased capability of individuals with this variant 
to express SLC2A3. This would lead to impairment of the immune 
response at the synovium, limitation in the ability of chondrocytes 
to respond to immune signaling and degrade cartilage, or a combi- 
nation of both mechanisms. 

It may be asked why this variant has remained undetected in 
recent large-scale GWAS involving RA. There are three explana- 
tions for this. First, GWAS are designed to test the common vari- 
ant common disease hypothesis, that is, they rely on LD between 
common markers and common causal variants (minor allele fre- 
quency (MAF) > 5%), and not low-frequency causal alleles. Second, 
HapMap data comparing SNPs within and neighboring the CNV, 
and our own data comparing CNV alleles to our previous SNP geno- 
typing, revealed no LD with neighboring SNP alleles with MAF > 5%. 
This is as expected for multiaUelic low-allele frequency CNVs ac- 
cording to published large-scale CNV studies [Conrad et al., 2010]. 
Third, even if the above two problems did not exist, previous RA 
GWAS have not employed sufficient samples to have power to detect 
this locus after correcting for multiple testing. 

The effect size of the association we have detected is greater than 
that for any other RA locus previously described, with the excep- 
tion of the HLA genes. It is also as large as any previously reported 
CNV association with any common disease. Additionally, since the 
mechanism we propose entails a loss-of-function allele that is dis- 
ease protective, this recommends it as a target for drug develop- 
ment, that is, the inhibition of SLC2A3 expression (and/or GLUTS 
activity) may provide a direct means to protect against RA in the 
97%-99% of individuals without the deletion allele. Furthermore, 
given the tendency for autoimmune disorders to share susceptibil- 
ity loci, and the role of SLC2A3 in the immune response, genetic 
variation in this region could also be important in other immune- 
related disorders. Finally, we note that GLUTS has been implicated 
by altered expression in a number of diseases-including dyslexia, 
Alzheimer's disease, schizophrenia, and Huntingtons disease, and 
increased expression of glucose transporters (in particular GLUTl 
and GLUTS) is also a characteristic feature of cancer cells [ Macheda 
et al., 2005; Yamamoto et al, 1990]. We therefore posit that the 
CNV we have described here may impact the risk of many and var- 
ious other diseases, and suggest this merits urgent and thorough 
examination. 



Acknowledgments 

This research was generously supported by Action Medical Research. The 
authors acknowledge the participating patients with RA, controls and all 
rheumatologists for recruiting patients in the EZRA study (Swedish Epi- 
demiological Investigation of Rheumatoid Arthritis) and from the UK (pro- 
vided by Prof. Jane Worthington). US patient and control collections at 
the Feinstein Institute were supported by the US National Institutes of 
Health grant R01-AR44422, with generous support from the Eileen Lud- 
wig Greenland Center for Rheumatoid Arthritis. This work made use of data 
and samples generated by the 1958 Birth Cohort (http://www2/le. ac.uk/ 
projects/birthcohort; http://www.bristol.ac.uk/alspac/; http://www.cls.ioe. 
ac.uk/ncds; http://www.esds.ac.uk/findingData/ncds.asp) under grant 
G0000934 from the Medical Research Council, and grant 068545/Z/02 from 
the Wellcome Trust. 



References 

Ainola MM, Mandelin JA, LHjestrom MP, Li TF, Hukkanen MVJ, Konttinen YT. 2005. 
Pannus invasion and cartilage degradation in rheumatoid arthritis: involvement 
of MMP-3 and interleukin-lbeta. Clin Exp Rheumatol 23:644-650. 

Aldhous MC, Abu Baker S, Prescott NJ, PaUa R, Soo K, Mansfield JC, Mathew CG, 
Satsangi J, Armour JAL. 2010. Measurement methods and accuracy in copy num- 
ber variation: failure to replicate associations of beta-defensin copy number with 
Crohn's disease. Hum Mol Genet 19:4930^938. 

Armour lAL, Palla R, Zeeuwen PLfM, Den Heijer M, Schalkwijk J, Hollox EJ. 2007. 
Accurate, high-throughput typing of copy number variation using paralogue ratios 
from dispersed repeats. Nucleic Acids Res 35:el9. 

Arnett FC, Edworthy SM, Bloch DA, McShane DJ, Fries IF, Cooper NS, Healey LA, 
Kaplan SR, Liang MH, Luthra HS. 1988. The American Rheumatism Association 
1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum 
31:315-324. 

Barksby HE, Hui W, Wappler I, Peters HH, Milner JM, Richards CD, Cawston TE, 
Rowan AD. 2006. Interleukin- 1 in combination with oncostatin M up-regulates 
multiple genes in chondrocytes: implications for cartilage destruction and repair. 
Arthritis Rheum 54:540-550. 

Begovich AB, Carlton VEH, Honigberg LA, Schrodi SJ, Chokkalingam AP, Alexander 
HC, Ardlie KG, Huang Q, Smith AM, Spoerke JM, Conn MT, Chang M, et al. 2004. 
A missense single-nudeotide polymorphism in a gene encoding a protein tyrosine 
phosphatase (PTPN22) is associated with rheumatoid arthritis. Am J Hum Genet 
75:330-337. 

Butler DM, Malfait AM, Mason LJ, Warden PJ, KoUias G, Maini RN, Feldmann M, 
Brennan FM. 1997. DBA/1 mice expressing the human TNF-alpha transgene 
develop a severe, erosive arthritis: characterization of the cytokine cascade and 
cellular composition. J Immunol 159:2867-2876. 

Cantsilieris S, White SJ. 2013. Correlating multiallelic copy number polymorphisms 
with disease susceptibility. Hum Mutat 34:1-13. 

Clayton DG, Walker NM, Smyth DJ, Pask R, Cooper JD, Maier LM, Lam AC, Ovington 
NR, Stevens HE, Nutland S, Howson JM, Faham M, et al. 2005. Population struc- 
ture, differential bias and genomic control in large-scale, case-control association 
study. Nat Genet 37:1243-1246. 

Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews 
TD, Barnes C, Campbell P, Fitzgerald T, Hu M, et al. 2010. Origins and func- 
tional impact of copy number variation in the human genome. Nature 464:704- 
712. 

Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB. 2010. Rare variants create 

synthetic genome-wide associations. PLoS Biol 8:12. 
Fredman D, White SJ, Potter S, Eichler EE, Den Dunnen JT, Brookes AJ. 2004. Complex 

SNP-related sequence variation in segmental genome duplications. Nat Genet 

36:861-866. 

Eyre S, Ke X, Lawrence R, Bowes J, Panoutsopoulou K, Barton A, Thomson W, 
Worthington J, Zeggini E. 2012 Examining the overlap between genome-wide 
rare variant association signals and linkage peaks in rheumatoid arthritis. Arthri- 
tis Rheum 63:1522-1526. 

Fode P, Jespergaard C, Hardwick RJ, Bogle H, Dodoo D, Lenicek M, Vitek L, Freitas 
J, Andersen PS, Hollox EJ. 2011. Determination of beta-defensin genomic copy 
number in different populations: a comparison of three methods. PLoS One 
6:el6768. 

Fu Y, Maianu L, Melbert BR, Garvey WT. 2004. Facilitative glucose transporter gene 
expression in human lymphocytes, monocytes, and macrophages: a role for GLUT 



HUMAN MUTATION, Vol. 35, No. 2, 248-256, 2014 



255 



isoforms 1, 3, and 5 in the immune response and foam cell formation. Blood Cells 
Mol Dis 32:182-190. 

Gonzalez E, Kulkarni H, Bolivar H, Mangano A, Sanchez R, Catano G, Nibbs R, 
Freedman B, Quinones M, Bamshad M, Murthy K, Rovin B, et al. 2005. The 
influence of CCL3L1 gene-containing segmental duplications on HIV-l/AIDS 
susceptibility. Science 307:1434-1440. 

Gregersen P, Silver J, Winchester R. 1987. The shared epitope hypothesis. Arthritis 
Rheum 30:1205-1213. 

Gregersen PK, Lee H-S, Batliwalla F, Begovich AB. 2006. PTPN22: setting thresholds 
for autoimmunity. Semin Immunol 18:214-223. 

Gregersen PK, Amos CI, Lee AT, Lu Y, Remmers EF, Kastner DL, Seldin MF, Criswell 
LA, Plenge RM, Holers M, Mikuls TR, Sokka T, et al. 2009. REL, encoding a 
member of the NF-kB family of transcription factors, is a newly defined risk locus 
for rheumatoid arthritis. Nat Genet 41:820-823. 

Hollox EJ, Huffmeier U, Zeeuwen PLJM, Palla R, Lascorz J, Rodijk-Olthuis D, Van 
De Kerkhof PCM, Traupe H, De Jongh G, Den Heijer M, Reis A, Armour JAL, 
Schalkwijk J. 2008. Psoriasis is associated with increased beta-defensin genomic 
copy number. Nat Genet 40:23-25. 

International Human Genome Sequencing Consortium, Lander ES, Linton LM, Birren 
B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, 
Funke R, et al. 2001. Initial sequencing and analysis of the human genome. Nature 
409:860-921. 

lonita-Laza I, Rogers AJ, Lange C, Raby BA, Lee C. 2009. Genetic association analysis of 
copy-number variation (CNV) in human disease pathogenesis. Genomics 93:22- 
26. 

Itsara A, Cooper GM, Baker C, Girirajan S, Li J, Absher D, Krauss RM, Myers RM, 

Ridker PM, Chasman DI, Mefford H, Ying P, Nickerson DA, Eichler EE. 2009. 

Population analysis of large copy number variants and hotspots of human genetic 

disease. Am J Hum Genet 84:148-161. 
Jansson AM, Jacobsson L, Luthman H, Lorentzen JC. 1999. Susceptibility to oil-induced 

arthritis is linked to Oia2 on chromosome 4 in a DA(DA x PVG.lAVl) backcross. 

Transplant Proc 31:1597-1599. 
Jawaheer D, Seldin MF, Amos CI, Chen WV, Shigeta R, Etzel C, Damle A, Xiao X, Chen 

D, Lum RF, Monteiro J, Kern M, et al. 2003. Screening the genome for rheumatoid 

arthritis susceptibility genes: a replication study and combined analysis of 512 

multicase families. Arthritis Rheum 48:906-916. 
Klareskog L, Stolt P, Lundberg K, Kallberg H, Bengtsson C, Grunewald J, Ronnelid J, 

Harris HE, Ulfgren A-K, Rantapaa-Dahlqvist S, Ekiund A, Padyukov L, Alfredsson 

L. 2006. A new model for an etiology of rheumatoid arthritis: smoking may trigger 

HLA-DR (shared epitope} -restricted immune reactions to autoantigens modified 

by citruUination. Arthritis Rheum 54:38-46. 
Lorentzen JC, Flornes L, Ekiow C, Backdahl L, Ribbhammar U, Guo JP, Smolnikova 

M, Dissen E, Seddighzadeh M, Brookes AJ, Alfredsson L, Klareskog L, et al. 2007. 

Association of arthritis with a gene complex encoding C-type lectin-like receptors. 

Arthritis Rheum 56:2620-2632. 
Lorentzen JC, Glaser A, Jacobsson L, Galli J, Fakhrai-Rad H, Klareskog L, Luthman 

H. 1998. Identification of rat susceptibility loci for adjuvant-oil-induced arthritis. 

Proc Nat Acad Sci USA 95:6383-6387. 
MacGregor AJ, Snieder H, Rigby AS, Koskenvuo M, Kaprio J, Aho K, Silman AJ. 2000. 

Characterizing the quantitative genetic contribution to rheumatoid arthritis using 

data from twins. Arthritis Rheum 43:30-37. 
Macheda ML, Rogers S, Best JD. 2005. Molecular and cellular regulation of glucose 

transporter (GLUT) proteins in cancer. J Cell Physiol 202:654-662. 
Maratou E, Dimitriadis G, Kollias A, Boutati E, Lambadiari V, Mitrou P, Raptis SA. 

2007. Glucose transporter expression on the plasma membrane of resting and 
activated white blood cells. Eur J Clin Invest 37:282-290. 

McKinney C, Merriman ME, Chapman PT, Gow PJ, Harrison AA, Highton J, Jones 
PBB, McLean L, O'Donnell JL, Pokorny V, Spellerberg M, Stamp LK, et al. 

2008. Evidence for an influence of chemokine ligand 3-like 1 (CCL3L1) gene 
copy number on susceptibility to rheumatoid arthritis. Ann Rheum Dis 67: 
409-413. 

Mitchell MK, Gregersen PK, Johnson S, Parsons R, Vlahov D, New York Cancer Project. 

2004. The New York Cancer Project: rationale, organization, design, and baseline 

characteristics. J Urban Health 81:301-310. 
Mills RE, Walter K, Stewart C, Handsaker RE, Chen K, Alkan C, Abyzov A, Yoon SC, 

Ye K, Cheetham RK, ChinwaUa A, Conrad DF, et al. 201 1. Mapping copy number 

variation by population-scale genome sequencing. Nature 470:59-65. 



Mitsui K, Tokuzawa Y, Itoh H, Segawa K, Murakami M, Takahashi K, Maruyama M, 
Maeda M, Yamanaka S. 2003. The homeoprotein Nanog is required for mainte- 
nance of pluripotency in mouse epiblast and ES cells. Cell 113:631-642. 

Mobasheri A, Bondy CA, Moley K, Mendes AF, Rosa SC, Richardson SM, Hoyland 
JA, Barrett-JoUey R, Shakibaei M. 2008. Facilitative glucose transporters in artic- 
ular chondrocytes. Expression, distribution and functional regulation of GLUT 
isoforms by hypoxia, hypoxia mimetics, growth factors and pro-inflammatory 
cytokines. Adv Anat Embryol Cell Biol 200: 1-84. 

Morgan AW, Thomson W, Martin SG, Carter AM, Erlich HA, Barton A, Hocking L, 
Reid DM, Harrison P, Wordsworth P, Steer S, Worthington J, et al. 2009. Reevalu- 
ation of the interaction between HLA-DRBl shared epitope alleles, PTPN22, and 
smoking in determining susceptibility to autoantibody-positive and autoantibody- 
negative rheumatoid arthritis in a large UK Caucasian population. Arthritis Rheum 
60:2565-2576. 

Otero, M, Goldring, MB. 2007. Cells of the synovium in rheumatoid arthritis. T lym- 
phocytes. Arthritis Res Ther 9:220. 

Plenge RM, Seielstad M, Padyukov L, Lee AT, Remmers EF, Ding B, Liew A, Khalili H, 
Chandrasekaran A, Davies LRL, Li W, Tan AKS, et al. 2007. TRAF1-C5 as a risk 
locus for rheumatoid arthritis — a genomewide study. N Eng J Med 357: 1 199-1209. 

R Development Core Team. (2008). R: a language and environment for statistical 
computing. R foundation for statistical computing. Vienna, Austria. ISBN 3- 
900051-07-0. 

Raychaudhuri S. 2010. Recent advances in the genetics of rheumatoid arthritis. Curr 

Opin Rheumatol 22:109-118. 
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero 

MH, Carson AR, Chen W, Cho EK, Dallaire S, et al. 2006. Global variation in copy 

number in the human genome. Nature 444:444—454. 
Remmers EF, Plenge RM, Lee AT, Graham RR, Hom G, Behrens TW, De Bakker PIW, 

Le JM, Lee H-S, Batliwalla F, Li W, Masters SL, et al. 2007. STAT4 and the risk of 

rheumatoid arthritis and systemic lupus erythematosus. N Eng J Med 357:977-986. 
Ribbhammar U, Flornes L, Backdahl L, Luthman H, Fossum S, Lorentzen JC. 2003. 

High resolution mapping of an arthritis susceptibility locus on rat chromosome 

4, and characterization of regulated phenotypes. Hum Mol Genet 12:2087-2096. 
Schmidt S, Hommel A, Gawlik V, Augustin R, Junicke N, Florian S, Richter M, Walther 

DJ, Montag D, Joost H-G, Schiirmann A. 2009. Essential role of glucose transporter 

GLUT3 for post-implantation embryonic development. J Endocrinol 200:23-33. 
Silman AJ, Newman J, MacGregor AJ. 1996. Cigarette smoking increases the risk of 

rheumatoid arthritis. Results from a nationwide study of disease-discordant twins. 

Arthritis Rheum 39:732-735. 
Stahl EA, Raychaudhuri S, Remmers EF, Xie G, Eyre S, Thomson BP, Li Y, Kurreeman 

FAS, Zhernakova A, Hinks A, Guiducci C, Chen R, et al. 2010. Genome-wide 

association study meta-analysis identifies seven new rheumatoid arthritis risk 

loci. Nat Genet 42:508-514. 
Thomson W, Barton A, Ke X, Eyre S, Hinks A, Bowes J, Donn R, Symmons D, Hider S, 

Bruce IN, Wilson AG, Marinou I, et al. 2007. Rheumatoid arthritis association at 

6q23. Nat Genet 39: 143 1-1433. 
Veal CD, Freeman PJ, Jacobs K, Lancaster O, Jamain S, Leboyer M, Albanes D, Vaghela 

RR, Gut I, Chanock SJ, Brookes AJ. 2012. A mechanistic basis for amplification 

differences between samples and between genome regions. BMC Genomics 13:455. 
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SFA, Hakonarson H, Bucan M. 2007. 

PennCNV: an integrated hidden Markov model designed for high -resolution copy 

number variation detection in whole-genome SNP genotyping data. Genome Res 

17:1665-1674. 

Wellcome Trust Case Control Consortium. 2007. Genome-wide association study of 
14,000 cases of seven common diseases and 3,000 shared controls. Nature 447:66 1- 
678. 

Wu X, Freeze HH. 2002. GLUT14, a duplicon of GLUT3, is specifically expressed in 
testis as alternative splice forms. Genomics 80:553-557. 

Yamamoto T, Seino Y, Fukumoto H, Koh G, Yano H, Inagaki N, Yamada Y, Inoue 
K, Manabe T, Imura H. 1990. Over- expression of facilitative glucose transporter 
genes in human cancer. Biochem Biophys Res Commun 170:223-230. 

Yang Y, Chung EK, Wu YL, SaveUi SL, Nagaraja HN, Zhou B, Hebert M, Jones KN, 
Shu Y, Kitzmiller K, Blanchong CA, McBride KL, et al. 2007. Gene copy-number 
variation and associated polymorphisms of complement component C4 in human 
systemic lupus erythematosus (SLE): low copy number is a risk factor for and high 
copy number is a protective factor against SLE susceptibility in European America. 
Am J Hum Genet 80:1037-1054. 



256 



HUMAN MUTATION, Vol. 35, No. 2, 248-256, 2014 



